Twitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination

نویسندگان

  • Olga Kolchyna
  • Tharsis T. P. Souza
  • Philip Treleaven
  • Tomaso Aste
چکیده

This paper presents a step-by-step methodology for Twitter sentiment analysis. Two approaches are tested to measure variations in the public opinion about retail brands. The first, a lexicon-based method, uses a dictionary of words with assigned to them semantic scores to calculate a final polarity of a tweet, and incorporates part of speech tagging. The second, machine learning approach, tackles the problem as a text classification task employing two supervised classifiers Naive Bayes and Support Vector Machines. We show that combining the lexicon and machine learning approaches by using a lexicon score as a one of the features in Naive Bayes and SVM classifications improves the accuracy of classification by 5%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A High-Performance Model based on Ensembles for Twitter Sentiment Classification

Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...

متن کامل

Review of Twitter sentiment analysis

Twitter data has recently been considered to perform a large variety of advanced analysis. Analysis of Twitter data imposes new challenges because the data distribution is intrinsically sparse, due to a large number of messages post every day by using a wide vocabulary. Sentiment Analysis task is divided in two steps: Feature selection methods and Sentiment classification methods. Feature selec...

متن کامل

Improving Sentiment Analysis Through Ensemble Learning of Meta-level Features

In this research, the well-known microblogging site, Twitter, was used for a sentiment analysis investigation. We propose an ensemble learning approach based on the meta-level features of seven existing lexicon resources for automated polarity sentiment classification. The ensemble employs four base learners (a Two-Class Support Vector Machine, a Two-Class Bayes Point Machine, a Two-Class Logis...

متن کامل

Hybrid Based Approach to Enhance the Accuracy of Sentiment Analysis on Tweets

The opinion of others toward a product or event plays an important role in the decision-making process. In recent times, the explosion of social media over the web has a rich impact on an individual’s and the organization’s decisionmaking process about certain content. Twitter which is a leading micro-blogging website allows the people to post their opinions, state of mind, or status toward pro...

متن کامل

Forecasting Stock Price Movements Based on Opinion Mining and Sentiment Analysis: An Application of Support Vector Machine and Twitter Data

Today, social networks are fast and dynamic communication intermediaries that are a vital business tool. This study aims at examining the views of those involved with Facebook stocks so that we can summarize their views to predict the general behavior of this stock and collectively consider possible Facebook stock price movements, and create a more accurate pattern compared to previous patterns...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015